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Abstract — A gambler walks into a hypothetical fair 
casino with a very real dollar bill, but by the time he 
leaves he's exchanged the dollar for a random amount 
of money. What is lost in the process? It may be that 
the gambler walks out at the end of the day, after a 
roller-coaster ride of winning and losing, with his dollar 
still intact, or maybe even with two dollars. But what 
the gambler loses the moment he places his first bet is 
position. He exchanges one distribution of money for a 
distribution of lesser quality, from which he cannot return. 
Our first discussion in this work connects known results 
of economic inequality and majorization to the probability 
theory of gambling and Martingales. We provide a simple 
proof that fair gambles cannot increase the Lorenz curve, 
and we also constructively demonstrate that any sequence 
of non-increasing Lorenz curves corresponds to at least one 
Martingale. 

We next consider the efficiency of gambles. If all fair 
gambles are available then one can move down the lattice 
of distributions defined by the Lorenz ordering. However, 
the step from one distribution to the next is not unique. Is 
there a sense of efficiency with which one can move down 
the Lorenz stream? One approach would be to minimize 
the average total volume of money placed on the table. In 
this case, it turns out that implementing part of the strategy 
using private randomness can help reduce the need for the 
casino's randomness, resulting in less money on the table 
that the casino cannot get its hands on. 

I. Introduction 

What is lost when one gambles? Even in a hypothet- 
ical fair casino with fair bets offered, a gambler trades 
one random variable of wealth for another — an exchange 
that cannot be reversed through subsequent fair gambles. 
A gambler who walks into a fair casino with $1 walks 
out with a random amount of wealth with mean still $ 1 . 
The gambler cannot devise a strategy that allows him to 
recover his $1 with probability one. He can only use fair 
gambles to exchange his random variable for one that is 
further degraded. 

In this paper we identify the Lorenz curve [1] as a 
simple way to characterize which probability distribu- 
tions can be transformed through fair gambles to which 
others — an idea that can be distilled from the literature 
through an investigation of topics such as majorization, 
Schur convexity, second-order stochastic dominance, and 
related ideas in [2]-[13], The partial ordering of non- 
negative, mean-one distributions, which places p\ >z pi 
if px can be the starting point of a fair gambling system 



with p2 the ending point, coincides with the partial 
ordering induced by the associated Lorenz curves. Thus, 
given two distributions p a and pb, we can use the Lorenz 
curve to identify the most degraded distribution from 
which we can gamble to either p a or pb- Similarly, 
we can find the least degraded distribution that can be 
produced from either starting point p a or pb- 

Example 1.1: Consider two different mean-one distri- 
butions p a and pb given by 



Pa(x) 

Pb(x) 



What is the first distribution that can be arrived at by 
starting from either p a or pb using fair gambles? It's not 
the distribution that places all of the mass on and 2. 
The least degraded distribution that they can both arrive 
at is 
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This is identified with ease from the Lorenz curves, 
illustrated in Fig. [4] 
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Fig. 1. Distributions p a and pf,, represented in the top two graphs, 
each have a mean of one, however no system of fair gambles can begin 
at one of these distributions and arrive at the other. The least degraded 
distribution that can be reached from either p a or pb is represented by 
Panb m the bottom graph. 
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Fig. 2. Lorenz curve for 2003 U.S. household income distribution. 



In this paper we show the relationship between Lorenz 
curves and fair gambling and discuss the lattice formed 
by this relationship. This implies a "second law" of 
Martingales, characterized by the Lorenz curve or any 
Schur convex function, similar to entropy increase in 
Markov chains. These relationships also provide a simple 
proof of weak convergence of Martingales. 

We also consider the efficiency of going from one 
distribution to another. For any system of gambles, there 
is an amount of money that may be lost and therefore 
must be placed on the table at a casino. To model the fact 
that casinos actually take a profit by offering games with 
a house edge, we can imagine that any money placed on 
the table is taxed. There are in general many ways of 
gambling from p\ to p2, assuming pi is less degraded 
than p2- We identify a general method for constructing 
the most efficient gambling system. 

II. Lorenz Curve 

A. Definition 

The Lorenz curve is a metric developed by Lorenz in 
1905 for the purpose of measuring economic inequality 
in a population. If the population is ordered according 
to increasing wealth, forming an increasing sequence of 
individual wealths W\, W2, tu/v, then the Lorenz curve 
is obtained from the sequence of partial sums of this 
sequence, U = Y^k=i w k- Normally, the Lorenz curve 
L(u) is scaled so that the domain and range are [0, 1]. 
That is, L(u) is the linear interpolation of the points 

In Figure [2] we see the Lorenz curve for the United 
States of America (based on income rather than wealth). 
We can interpret the curve in the following way. Con- 
sider the point A — (.6, .27). This says that the poorest 
60% of the population account for 27% of the total 
income. 

The Lorenz curve can also be defined for a mean-one 
probability distribution. The interpretation is that each 



quantum of probability corresponds to an individual in 
the economic wealth distribution setting. 

For a continuous random variable X ~ Px{%) we 
define 



L(u) 



min E Xlx<=A- 

AOSt:P(A)=v, 



We can define the Lorenz curve in a way that ac- 
commodates all random variables (continuous, discrete, 
and singular) by expressing it as an optimization over 
all random variables U that are correlated with X. The 
general expression for the Lorenz curve is given by 



L(u) 



min EXl UeA . (1) 

pm x ,A: P(UeA)=u 



For example, we might let U be a random variable 
with a uniform distribution on the interval [0, 1], and let 
g(u) be a function such that X = g(U) ~ px ■ This is in 
fact an optimal choice of U. An example of a function 
that would work would be the inverse of the cumulative 
distribution function which we define by 



F-\u) 



inf{a; : F(x) > u}. 



A straightforward way of calculating the Lorenz curve 



L(u) = / F-^^dr. 



B. Properties 

1) L(0) = 0. 

2) L(l) = 1. 

3) L(u) is a continuous, convex function. 

4) < L(u) <u, Vu e [0,1]. 

Any function with these properties is a Lorenz curve 
and uniquely specifies a mean-one distribution. 

It is possible to talk about Lorenz curves for random 
variables that take negative values, in which case Prop- 
erty 4 would not necessarily hold. We only discuss non- 
negative random variables in this work. 

III. Martingale Lattice 

A. Feasibility 

Suppose one wishes to exchange a random variable 
Xi ~ px 1 for another random variable X 2 ~ px 2 using 
a system of fair gambles. The system may specify a 
different fair gamble, or sequence of gambles, for each 
value of X\, The following theorem states when such a 
system exists. 

Theorem 3.1: Given marginal distributions px t and 
px 2 with mean one, there exists a joint distribution 
PXi,x 2 with the property that E (X2\X\) — X\ if and 
only if Li(u) > Li2(u) Vu G [0,1], where L\ is the 
Lorenz curve for px 1 and L2 is the Lorenz curve for 
Px 2 - 



Proof: This theorem can be distilled from the 
literature. The concept of majorization, which applies 
to vectors, is related to the Lorenz curve. If a random 
variable X is formed from taking the uniform distri- 
bution over the elements of a vector x, and a random 
variable Y and vector y share the same relationship, then 
x majorizes y if and only if the Lorenz curve for Y is 
above the Lorenz curve for X. Based on this connection, 
the theorem can be established. However, we present a 
simple proof of the theorem here. 

For the converse part of the proof, assume that a joint 
distribution px lt x 2 nas me property that E (X 2 \Xi) = 
X\. Construct X\ = g(U) ~ px t where U is uniformly 
distributed on [0,1], as is used in the paragraph following 
<H). Also let U — Xi — X% form a Markov chain. 

For all u £ [0, 1], 

LAu) = min E X\\u^a 

A : P(U<EA)=u 

= E XilueA" 

= E E (X 2 \Xi) lueA* 

= EE{X 2 \X 1 ,U)l UeA . 

= EE(X 2 l UeA ,\X 1 ,U) 

= E X 2 lueA* 

> min E X 2 1 U£A 

PU\X 2 >A : P(U€A)=u 

= L 2 (u). 

The symbol A* refers to the argmin of the right-hand 
side of the first equality. The fourth equality is due to 
the construction of U having the Markov relationship 
U-Xi- X 2 . 

For the direct part of the proof, first note that 
£7[Jf2|Xi] = X\ is equivalent to the existence of a 
sequence of fair gambles that starts with wealth X\ and 
ends up with wealth X 2 . For simplicity, we provide a 
proof for discrete random variables with finite alphabets 
in this paper. The proof for random variables with 
general distribution will be provided in the full paper 
under preparation. 

The following lemma solves the case where X\ = 1, 
which by scaling, holds for X± = c for any constant 
c> 0. 

Lemma 3.2: Let X be a discrete random variable with 
support set {xi, ..,x n }, x\ < x 2 ... < x n , and probability 
mass function {pi, ...,p n }, satisfying x\ > and EX = 
c. There exists a sequence of fair binary gambles that 
starts with $c and ends up with wealth X. 

Proof: We give a proof by induction. For n = 2, the 
lemma automatically holds. Suppose the lemma holds for 
n = k. Consider the case n = k + 1. First, use a fan- 
binary gamble to generate a binary random variable V: 



If V = x%, we stop the gambling; if V = 
using the inductive assumption, there exists a sequence 

ith 

up with a random variable X 1 ~ (y^ 2 - 



l— pi 

. se 

of fair binary gambles that starts with 1 ~ r ^ lXl and ends 

-Pi ' 1— Pi ' '"' 1-Pl ' 

with support set {x 2 , x n }. This completes the proof 
of Lemma 13.21 ■ 




Fig. 3. Generate py from an upstream distribution px using a 
sequence of fair binary gambles. 



Suppose X and Y are two non-negative discrete 
random variables with finite alphabets {xi, x n } and 
{j/i, y n '}, respectively, and mean one, and that the 
associated Lorenz curves of X and Y are L\ and L 2 , 
respectively, where Li(u) > L 2 (u). Again, we will 
use induction to prove the result. Note that Lemma 13.21 
proves the case \X\ — 1, which serves as the starting 
case. Assume the direct part of the theorem holds for 
\X\ = k. Consider the case \X\ = k + 1. Extend the line 
segment of L 1 (u) for u G [px{xi),px{xi)+px(x 2 )] to 
the left, i.e., the region u < px(xi), until it intersects L 2 
at some point (u*, I*) The convexity of the Lorenz curve 
and the fact that L\{u) > L 2 (u) guarantee that such a 
point exists and u* < px{x\), illustrated in Fig. [3] Let 
to* = argmaxjj : Pr(F < Vj) < u*}, and define the 
distribution of V by 
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ptMLk i < m * - 1 

PX^!)' — — 

Px(xi) 
Px{xi)-u* 



V = 



1-piXt 

1-Pl ' 

Xi, 



w.p. 1 — p\ 
w.p. pi 



It is easy to check that EV = X\. In the first 
step of the gamble, if X ^ x\, we do nothing, if 
X = x±, we use x\ to generate the random variable V, 
which is feasible due to Lemma 13.21 By doing so, we 
exchange Xi for a random variable Y\ whose Lorenz 
curve coincides with L 2 for u £ [0, u*] and coincides 



with Li for u e [px{x\),l\- Note that Yi|Yi > x 2 
has a support set of cardinality \X\ — 1, which is equal 
to fc. Thus by the assumption of the induction, we can 
use YijYi > X2 to generate L%(u), u € (u*,l]. This 
completes the direct part of the theorem for discrete 
random variables with finite alphabets. ■ 

B. Lattice 

Theorem 13.11 gives a partial ordering of mean-one 
distributions, based on the Lorenz curve. We say that 
Pi h P2, or pi is upstream from p2, or pi is more 
degraded than p\ if L\ > L2 Vu G [0, 1]. 

The partial ordering of distributions according to the 
Lorenz curve gives a lattice. For any two mean-one 
distributions that are incomparable (the Lorenz curves 
cross), we can identify the last distribution (most de- 
graded) from which they each can be produced through 
a system of fair gambles. This will correspond to the 
point-wise maximum of the two Lorenz curves. We can 
also identify the first distribution (least degraded) that 
they can each produce. This is the greatest Lorenz curve 
below both. Due to the convexity of the Lorenz curve, 
this will be the lower boundary of the convex hull of the 
epigraphs of both Lorenz curves. 



Theorem 3.3: For a sequence {Xi} resulting from fair 
gambles, there exists a random variable X such that 




Fig. 4. This graph depicts the Lorenz curves of the least degraded 



distribution L 



Pa f\Pb 



below p a and p a from Example l 1.1 l and the most 



degraded distribution L pa y Pb above p a and p a . 



C. Martingale Convergence 

Let Xo, Xi,X2, ... be a sequence of random variables 
produced by fair gambles (wealths X n at times n = 
0, 1, 2, ...). Let Fo,Fx,... be the corresponding sequence 
of cumulative distribution functions and Lq(u), L±(u), ... 
their Lorenz functions. Then we have the following 
simple proof of the convergence of X n in distribution. 



X n 



X in distribution. 



(i.e. F n (x) — > F(x) at every point of continuity of F.) 

Proof: The sequence of Lorenz curves L n (u) is 
nonnegative, monotonically nonincreasing and thus has 
a limit L(u). F(x) is the corresponding limiting cu- 
mulative distribution function. In the case where L n (u) 
converges to zero for all u £ [0, 1), L(u) is not a Lorenz 
curve for lack of continuity, and the limiting distribution 
is zero with probability one. ■ 

D. Second Law of Martingales 

The second law of thermodynamics has a close rela- 
tionship to the behavior of entropy in Markov chains. For 
a Markov chain with a uniform stationary distribution, 
entropy always increases. However, this is not true for 
Martingales. 

Theorem 13.11 indicates that Martingales degrade in 
a different sense. The Lorenz curve cannot increase. 
A number of measures of inequality derived from the 
Lorenz curve, such as the Gini index, can only increase 
as a Martingale progresses. 

IV. Efficiently gambling down the Lorenz 

STREAM 

A. Air gambles 

Suppose the casino offers binary fair bets: 

2X, w.p. i, 
0, w.p. |. 

Suppose a gambler starts with $1 and wants to achieve 
a uniform distribution unif(0, 2) on his wealth after 
gambling. There is more than one possible gambling 
strategy to achieve this: 
Method 1: 

. Bet $|, then bet $±, then $±, .... 

• After infinite bets the distribution of wealth is 
unif(0,2). 

In this method, the gambler must place on the casino's 
table a total of: 

1 1 1 

2 + 4 + 8 + - 
Method 2: 

> Generate Y ~ unif(0, 1). 
. Bet $Y. 

In this method, the gambler used "air gambles" to 
randomize the amount of money placed on the table and 
supplement the "physical gambles" implemented by the 
casino, resulting in a reduced expected amount of money 
placed on the table, 

E{Y) = \<1. 



X 



1. 



B. Optimal efficiency 

We now ask "What is the minimum volume of bets 
to be placed on the table to achieve a desired target 
distribution?" We characterize the optimal efficiency for 
certain special cases of the target distribution. 

Theorem 4.1: Suppose a casino offers all fair binary 
gambles, i.e. Vc e (0, 1), the casino offers 



c -> 



1, w.p. c, 
0, w.p. 1 — c, 

which can be scaled up or down (both cost and payout) 
as desired. Then, the minimum expected amount that one 
has to place on the casino's table in order to gamble from 
x to or 1 is given by V*(x) = — (1 — x) ln(l — x) < x. 

Proof: 
Achievability: 

Suppose we start with x. Here is a strategy to gamble 
to or 1: 

1) Choose n large enough and let 5x = ^a. 

2) The algorithm is n steps long. Let x n denote the 
wealth before the nth step. If x n ^ 1, make the 
following gamble: 

8x 

Sx ->■ 

w.p. 



1 — x n +5x 
Y 5x 



-x n +Sx 

3) If the gamble succeeds, we are left with 1. If it 
fails, we are left with (x — Sx) and we continue 
the process. 

Let V(x) denote the total volume of bets placed on 
the table to reach or 1 from x. Then, 

Sx 



V(x) = Sx + 



1 - 



V(x-5x). 



1 — x + Sx 

In the limit as Sx — > 0, this can be recast as a differential 
equation 



dV{x) 



V(x) 



dx 1 — x 

Solving this differential equation with boundary condi- 
tions V(0) = V(l) = gives 

V(x) = -(1-x) ln(l-z). 

Converse: 

We only outline the proof of the converse. In the above 
achievability scheme, we used only "physical" gambles. 
We did not use "air" gambles. So we must first reason 
why air gambles are not needed. Note that the target 
distribution 



X 



1. 



w.p. 



0, w.p. 1 



is not a convex combination of two or more different 
distributions with mean 1. In other words, it is an 
"extreme" distribution: an extreme point of the convex 



set of probability distributions with mean 1. A sequence 
of "air" gambles is simply a convex combination of 
"physical" gambles. Since the target distribution X con- 
sidered above is an extreme distribution, every physical 
gambling tree has to achieve X on termination. The 
volume of bets in an "air" gambling scheme is the 
average of the volume of bets in physical gambling 
schemes that constitute the air gambling scheme. It 
follows that we need only minimize the volume V*(x) 
over physical gambling schemes. 

Now to establish optimality over physical gambles, we 
use a dynamic programming approach similar to [13]. 
According to the Bellman equation, we only need to 
show that V* satisfies 

V*(a+(b-a)6) < (b-a)6 + 

(l-9)V*(a) + 9V*(b) 

whenever < a < b < 1, and < 6 < 1. This can be 
proven using some algebra and calculus. 

The left hand side of the above inequality simply 
uses the scheme and expends a volume V*. The right 
hand side deviates from V* for only the first time step 
by playing an arbitrary gamble (b — a)6 — > (b — a). 
The inequality suggests deviating from V* is worse. 
By invoking this dynamic programming argument, we 
conclude that V* is optimal. ■ 
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